71 research outputs found

    Intelligent system for spoken term detection using the belief combination

    Get PDF
    Spoken Term Detection (STD) can be considered as a sub-part of the automatic speech recognition which aims to extract the partial information from speech signals in the form of query utterances. A variety of STD techniques available in the literature employ a single source of evidence for the query utterance match/mismatch determination. In this manuscript, we develop an acoustic signal processing based approach for STD that incorporates a number of techniques for silence removal, dynamic noise filtration, and evidence combination using Dempster-Shafer Theory (DST). A ‘spectral-temporal features based voiced segment detection’ and ‘energy and zero cross rate based unvoiced segment detection’ are built to remove the silence segments in the speech signal. Comprehensive experiments have been performed on large speech datasets and consequently satisfactory results have been achieved with the proposed approach. Our approach improves the existing speaker dependent STD approaches, specifically the reliability of query utterance spotting by combining the evidences from multiple belief sources

    Flight Guardian: Autonomous Flight Safety Improvement by Monitoring Aircraft Cockpit Instruments

    Get PDF
    During in-flight emergencies, a pilot’s workload increases significantly and it is often during this period of increased stress that human errors occur that consequently diminish the flight safety. Research studies indicate that many plane crashes can be attributed to ineffective cockpit instrument monitoring by the pilot. This manuscript entails the development of Flight Guardian(FG) system being first of its kind that aims to provide efficient flight-deck awareness to improve flight safety while assisting the pilot in abnormal situations. The system is intended to be used in older aircrafts that cannot easily or cost effectively be modified with modern digital avionic systems. One of the important feature of FG system being not physically connected to the aircraft, avoids any impact on airworthiness or the need for re-certification. For the first time, a composite of techniques including video analysis, knowledge representation, and machine belief representations are combined to build a novel flight-deck warning system. The prototype system is tested in both; simulation based Lab and real flight environments under the guidance of expert pilots. The overall system performance is evaluated using statistical analysis of experimental results that proved the robustness of proposed methodology in terms of automated warning generation in hazardous situations

    Novel hybrid object-based non-parametric clustering approach for grouping similar objects in specific visual domains

    Get PDF
    Current widely employed clustering approaches may not yield satisfactory results with regard to the characteristics and distribution of datasets and number of clusters to be sought, especially for visual domains in multidimensional space. This study establishes a novel clustering methodology using a pairwise similarity matrix, Clustering Visual Objects in Pairwise Similarity Matrix (CVOIPSM), for grouping similar objects in specific visual domains. A dimensionality reduction and feature extraction technique, along with a distance measuring method and a newly established algorithm, Clustering in Pairwise Similarity Matrix (CIPSM), are combined to develop the CVOIPSM methodology. CIPSM utilizes both Rk-means and an agglomerative, contractible, expandable (ACE) technique to calculate a membership degree based on maximizing inter-class similarity and minimizing intra-class similarity. CVOIPSM has been tested on several datasets, with average success rates on downsized subsamples between 87.5\% and 97.75\% and between 81\% and 87\% on the larger datasets. The difference in the success rates for small and large datasets is not statistically significant (p>0.01). Moreover, this method automatically determines the likely number of clusters without any user dictation. The empirical results and the statistical significance test on these results ensure that CVOIPSM performs effectively and efficiently on specific visual domains, disclosing the interrelated patterns of similarities among objects

    Automated Aircraft Instrument Reading Using Real Time Video Analysis

    Get PDF
    Automated Dial Reading (ADR) using image processing is a challenging task that has to deal with the dynamics of real time environment. Literature contains limited research work for ADR that is based on background subtraction, object tracking, and pattern recognition. These methods suffer from dynamic environment such as: varying light intensity, poor resolution, and vibrations in capturing device. A valuable contribution to the existing dial reading approaches is made in this paper by deploying convolution method which plays a significant role in needle/hand recognition within a dial. Proposed dial reading approach is successfully used and tested reading analogue aircraft instruments facilitated by the Flight Guardian (FG) project for automated reading of the cockpit devices in dynamic environments. Performance is evaluated by statistical analysis of the experimental results that proved the robustness of the proposed method

    A framework for the synergistic integration of fully autonomous ground vehicles with smart city

    Get PDF
    Most of the vehicle manufacturers aim to deploy level-5 fully autonomous ground vehicles (FAGVs) on city roads in 2021 by leveraging extensive existing knowledge about sensors, actuators, telematics and Artificial Intelligence (AI) gained from the level-3 and level-4 autonomy. FAGVs by executing non-trivial sequences of events with decimetre-level accuracy live in Smart City (SC) and their integration with all the SC components and domains using real-time data analytics is urgent to establish better swarm intelligent systems and a safer and optimised harmonious smart environment enabling cooperative FAGVs-SC automation systems. The challenges of urbanisation, if unmet urgently, would entail severe economic and environmental impacts. The integration of FAGVs with SC helps improve the sustainability of a city and the functional and efficient deployment of hand over wheels on robotized city roads with behaviour coordination. SC can enable the exploitation of the full potential of FAGVs with embedded centralised systems within SC with highly distributed systems in a concept of Automation of Everything (AoE). This paper proposes a synergistic integrated FAGV-SC holistic framework - FAGVinSCF in which all the components of SC and FAGVs involving recent and impending technological advancements are moulded to make the transformation from today's driving society to future's next-generation driverless society smoother and truly make self-driving technology a harmonious part of our cities with sustainable urban development. Based on FAGVinSCF, a simulation platform is built both to model the varying penetration levels of FAGV into mixed traffic and to perform the optimal self-driving behaviours of FAGV swarms. The results show that FAGVinSCF improves the urban traffic flow significantly without huge changes to the traffic infrastructure. With this framework, the concept of Cooperative Intelligent Transportation Systems (C-ITS) is transformed into the concept of Automated ITS (A-ITS). Cities currently designed for cars can turn into cities developed for citizens using FAGVinSCF enabling more sustainable cities

    Pupil Localisation and Eye Centre Estimation using Machine Learning and Computer Vision

    Get PDF
    Various methods have been used to estimate the pupil location within an image or a real-time video frame in many fields. However, these methods lack the performance specifically in low-resolution images and varying background conditions. We propose a coarse-to-fine pupil localisation method using a composite of machine learning and image processing algorithms. First, a pre-trained model is employed for the facial landmark identification to extract the desired eye-frames within the input image. We then use multi-stage convolution to find the optimal horizontal and vertical coordinates of the pupil within the identified eye-frames. For this purpose, we define an adaptive kernel to deal with the varying resolution and size of input images. Furthermore, a dynamic threshold is calculated recursively for reliable identification of the best-matched candidate. We evaluated our method using various statistical and standard metrics along-with a standardized distance metric we introduce first time in this study. Proposed method outperforms previous works in terms of accuracy and reliability when benchmarked on multiple standard datasets. The work has diverse artificial intelligence and industrial applications including human computer interfaces, emotion recognition, psychological profiling, healthcare and automated deception detection
    • …
    corecore